智能论文笔记

A Survey on Medical Document Summarization

Raghav Jain , Anubhav Jangra , Sriparna Saha , Adam Jatowt

分类：自然语言处理

2022-12-03

The internet has had a dramatic effect on the healthcare industry, allowing documents to be saved, shared, and managed digitally. This has made it easier to locate and share important data, improving patient care and providing more opportunities for medical studies. As there is so much data accessible to doctors and patients alike, summarizing it has become increasingly necessary - this has been supported through the introduction of deep learning and transformer-based networks, which have boosted the sector significantly in recent years. This paper gives a comprehensive survey of the current techniques and trends in medical summarization

translated by 谷歌翻译

TimeBERT: Extending Pre-Trained Language Representations with Temporal Information

Jiexin Wang , Adam Jatowt , Masatoshi Yoshikawa

分类：自然语言处理

2022-04-27

时间是文档的重要方面，用于一系列NLP和IR任务。在这项工作中，我们研究了在预训练期间合并时间信息的方法，以进一步提高与时间相关的任务的性能。与Bert相比，使用同步文档收集（BooksCorpus和English Wikipedia）作为培训语料库相比，我们使用长跨度的时间新闻文章集合来构建单词表示。我们介绍了Timebert，这是一种新颖的语言表示模型，该模型通过两项新的预训练任务培训了新闻文章的临时收集，这些任务利用了两个不同的时间信号来构建时间认识的语言表示。实验结果表明，TimeBert始终胜过BERT和其他现有的预训练模型，在不同的下游NLP任务或应用程序上，时间很高的时间很重要。

translated by 谷歌翻译

Change Summarization of Diachronic Scholarly Paper Collections by Semantic Evolution Analysis

Naman Paharia , Muhammad Syafiq Mohd Pozi , Adam Jatowt

分类：自然语言处理

2021-12-07

在过去几年中，学术数据的数量一直在急剧增加。对于特定科学领域的新人（例如，IR，物理学，NLP）往往难以解决更大的趋势，并在先前科学成就和突破的背景下定位最新研究。同样，科学史上的研究人员对允许他们分析和可视化特定科学域中的变化的工具感兴趣。时间摘要和相关方法应该是有用的，以使大量的科学话语数据随时间汇总。我们展示了一种新颖的分析研究论文收集的方法，在较长的时间内发布，以提供在时间进展情况上发生的重要语义变革的高级概述。我们的方法是基于比较单词语义表示随着时间的推移，并旨在支持用户更好地理解学术出版物的大型域名档案。作为一个示例数据集，我们使用从1979年到2015年的ACL原点参考语料库，并包含22,878篇学术文章。

translated by 谷歌翻译

Through-life Monitoring of Resource-constrained Systems and Fleets

Felipe Montana , Adam Hartwell , Will Jacobs , Visakan Kadirkamanathan , Andrew R Mills , Tom Clark

分类：机器学习

2023-01-03

A Digital Twin (DT) is a simulation of a physical system that provides information to make decisions that add economic, social or commercial value. The behaviour of a physical system changes over time, a DT must therefore be continually updated with data from the physical systems to reflect its changing behaviour. For resource-constrained systems, updating a DT is non-trivial because of challenges such as on-board learning and the off-board data transfer. This paper presents a framework for updating data-driven DTs of resource-constrained systems geared towards system health monitoring. The proposed solution consists of: (1) an on-board system running a light-weight DT allowing the prioritisation and parsimonious transfer of data generated by the physical system; and (2) off-board robust updating of the DT and detection of anomalous behaviours. Two case studies are considered using a production gas turbine engine system to demonstrate the digital representation accuracy for real-world, time-varying physical systems.

translated by 谷歌翻译

Detection of out-of-distribution samples using binary neuron activation patterns

Bartlomiej Olber , Krystian Radlak , Adam Popowicz , Michal Szczepankiewicz , Krystian Chachula

分类：机器学习

2022-12-29

Deep neural networks (DNN) have outstanding performance in various applications. Despite numerous efforts of the research community, out-of-distribution (OOD) samples remain significant limitation of DNN classifiers. The ability to identify previously unseen inputs as novel is crucial in safety-critical applications such as self-driving cars, unmanned aerial vehicles and robots. Existing approaches to detect OOD samples treat a DNN as a black box and assess the confidence score of the output predictions. Unfortunately, this method frequently fails, because DNN are not trained to reduce their confidence for OOD inputs. In this work, we introduce a novel method for OOD detection. Our method is motivated by theoretical analysis of neuron activation patterns (NAP) in ReLU based architectures. The proposed method does not introduce high computational workload due to the binary representation of the activation patterns extracted from convolutional layers. The extensive empirical evaluation proves its high performance on various DNN architectures and seven image datasets. ion.

translated by 谷歌翻译

Joint Action is a Framework for Understanding Partnerships Between Humans and Upper Limb Prostheses

Michael R. Dawson , Adam S. R. Parker , Heather E. Williams , Ahmed W. Shehata , Jacqueline S. Hebert , Craig S. Chapman , Patrick M. Pilarski

分类：人工智能 | 机器人

2022-12-28

Recent advances in upper limb prostheses have led to significant improvements in the number of movements provided by the robotic limb. However, the method for controlling multiple degrees of freedom via user-generated signals remains challenging. To address this issue, various machine learning controllers have been developed to better predict movement intent. As these controllers become more intelligent and take on more autonomy in the system, the traditional approach of representing the human-machine interface as a human controlling a tool becomes limiting. One possible approach to improve the understanding of these interfaces is to model them as collaborative, multi-agent systems through the lens of joint action. The field of joint action has been commonly applied to two human partners who are trying to work jointly together to achieve a task, such as singing or moving a table together, by effecting coordinated change in their shared environment. In this work, we compare different prosthesis controllers (proportional electromyography with sequential switching, pattern recognition, and adaptive switching) in terms of how they present the hallmarks of joint action. The results of the comparison lead to a new perspective for understanding how existing myoelectric systems relate to each other, along with recommendations for how to improve these systems by increasing the collaborative communication between each partner.

translated by 谷歌翻译

A Generalization of ViT/MLP-Mixer to Graphs

Xiaoxin He , Bryan Hooi , Thomas Laurent , Adam Perold , Yann LeCun , Xavier Bresson

分类：计算机视觉

2022-12-27

Graph Neural Networks (GNNs) have shown great potential in the field of graph representation learning. Standard GNNs define a local message-passing mechanism which propagates information over the whole graph domain by stacking multiple layers. This paradigm suffers from two major limitations, over-squashing and poor long-range dependencies, that can be solved using global attention but significantly increases the computational cost to quadratic complexity. In this work, we propose an alternative approach to overcome these structural limitations by leveraging the ViT/MLP-Mixer architectures introduced in computer vision. We introduce a new class of GNNs, called Graph MLP-Mixer, that holds three key properties. First, they capture long-range dependency and mitigate the issue of over-squashing as demonstrated on the Long Range Graph Benchmark (LRGB) and the TreeNeighbourMatch datasets. Second, they offer better speed and memory efficiency with a complexity linear to the number of nodes and edges, surpassing the related Graph Transformer and expressive GNN models. Third, they show high expressivity in terms of graph isomorphism as they can distinguish at least 3-WL non-isomorphic graphs. We test our architecture on 4 simulated datasets and 7 real-world benchmarks, and show highly competitive results on all of them.

translated by 谷歌翻译

Beyond 5G Networks: Integration of Communication, Computing, Caching, and Control

Musbahu Mohammed Adam , Liqiang Zhao , Kezhi Wang , Zhu Han

分类：机器学习

2022-12-26

In recent years, the exponential proliferation of smart devices with their intelligent applications poses severe challenges on conventional cellular networks. Such challenges can be potentially overcome by integrating communication, computing, caching, and control (i4C) technologies. In this survey, we first give a snapshot of different aspects of the i4C, comprising background, motivation, leading technological enablers, potential applications, and use cases. Next, we describe different models of communication, computing, caching, and control (4C) to lay the foundation of the integration approach. We review current state-of-the-art research efforts related to the i4C, focusing on recent trends of both conventional and artificial intelligence (AI)-based integration approaches. We also highlight the need for intelligence in resources integration. Then, we discuss integration of sensing and communication (ISAC) and classify the integration approaches into various classes. Finally, we propose open challenges and present future research directions for beyond 5G networks, such as 6G.

translated by 谷歌翻译

Exploring the Optimized Value of Each Hyperparameter in Various Gradient Descent Algorithms

Abel C. H. Chen

分类：机器学习

2022-12-23

In the recent years, various gradient descent algorithms including the methods of gradient descent, gradient descent with momentum, adaptive gradient (AdaGrad), root-mean-square propagation (RMSProp) and adaptive moment estimation (Adam) have been applied to the parameter optimization of several deep learning models with higher accuracies or lower errors. These optimization algorithms may need to set the values of several hyperparameters which include a learning rate, momentum coefficients, etc. Furthermore, the convergence speed and solution accuracy may be influenced by the values of hyperparameters. Therefore, this study proposes an analytical framework to use mathematical models for analyzing the mean error of each objective function based on various gradient descent algorithms. Moreover, the suitable value of each hyperparameter could be determined by minimizing the mean error. The principles of hyperparameter value setting have been generalized based on analysis results for model optimization. The experimental results show that higher efficiency convergences and lower errors can be obtained by the proposed method.

translated by 谷歌翻译

Human Activity Recognition in an Open World

Derek S. Prijatelj , Samuel Grieggs , Jin Huang , Dawei Du , Ameya Shringi , Christopher Funk , Adam Kaufman , Eric Robertson , Walter J. Scheirer

分类：计算机视觉

2022-12-23

Managing novelty in perception-based human activity recognition (HAR) is critical in realistic settings to improve task performance over time and ensure solution generalization outside of prior seen samples. Novelty manifests in HAR as unseen samples, activities, objects, environments, and sensor changes, among other ways. Novelty may be task-relevant, such as a new class or new features, or task-irrelevant resulting in nuisance novelty, such as never before seen noise, blur, or distorted video recordings. To perform HAR optimally, algorithmic solutions must be tolerant to nuisance novelty, and learn over time in the face of novelty. This paper 1) formalizes the definition of novelty in HAR building upon the prior definition of novelty in classification tasks, 2) proposes an incremental open world learning (OWL) protocol and applies it to the Kinetics datasets to generate a new benchmark KOWL-718, 3) analyzes the performance of current state-of-the-art HAR models when novelty is introduced over time, 4) provides a containerized and packaged pipeline for reproducing the OWL protocol and for modifying for any future updates to Kinetics. The experimental analysis includes an ablation study of how the different models perform under various conditions as annotated by Kinetics-AVA. The protocol as an algorithm for reproducing experiments using the KOWL-718 benchmark will be publicly released with code and containers at https://github.com/prijatelj/human-activity-recognition-in-an-open-world. The code may be used to analyze different annotations and subsets of the Kinetics datasets in an incremental open world fashion, as well as be extended as further updates to Kinetics are released.

translated by 谷歌翻译